Pooling Robust Shift-Invariant Sparse Representations of Acoustic Signals

نویسندگان

  • Po-Sen Huang
  • Jianchao Yang
  • Mark Hasegawa-Johnson
  • Feng Liang
  • Thomas S. Huang
چکیده

In recent years, designing the coding and pooling structures in layered networks has been shown to be a useful method for learning high-level feature representations for visual data. Yet, such learning structures have not been extensively studied for audio signals. In this paper, we investigate different pooling strategies based on the sparse coding scheme and propose a temporal pyramid pooling method to extract discriminative and shiftinvariant feature representations. We demonstrate the superiority of our new feature representation over traditional features on the acoustic event classification task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Translation Invariant Approach for Measuring Similarity of Signals

In many signal processing applications, an appropriate measure to compare two signals plays a fundamental role in both implementing the algorithm and evaluating its performance. Several techniques have been introduced in literature as similarity measures. However, the existing measures are often either impractical for some applications or they have unsatisfactory results in some other applicati...

متن کامل

Translation Invariant Approach for Measuring Similarity of Signals

In many signal processing applications, an appropriate measure to compare two signals plays a fundamental role in both implementing the algorithm and evaluating its performance. Several techniques have been introduced in literature as similarity measures. However, the existing measures are often either impractical for some applications or they have unsatisfactory results in some other applicati...

متن کامل

Learning An Invariant Speech Representation

Recognition of speech, and in particular the ability to generalize and learn from small sets of labeled examples like humans do, depends on an appropriate representation of the acoustic input. We formulate the problem of finding robust speech features for supervised learning with small sample complexity as a problem of learning representations of the signal that are maximally invariant to intra...

متن کامل

Airplane detection based on rotation invariant and sparse coding in remote sensing images

Airplane detection has been taking a great interest to researchers in the remote sensing filed. In this paper, we propose a new approach on feature extraction for airplane detection based on sparse coding in high resolution optical remote sensing images. However, direction of airplane in images brings difficulty on feature extraction. We focus on the airplane feature possessing rotation invaria...

متن کامل

Learning An Invariant Speech Representation by

Recognition of speech, and in particular the ability to generalize and learn from small sets of labeled examples like humans do, depends on an appropriate representation of the acoustic input. We formulate the problem of finding robust speech features for supervised learning with small sample complexity as a problem of learning representations of the signal that are maximally invariant to intra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012